Back

JNCI: Journal of the National Cancer Institute

Oxford University Press (OUP)

Preprints posted in the last 90 days, ranked by how well they match JNCI: Journal of the National Cancer Institute's content profile, based on 16 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
Integrating Lung Tissue-based Transcriptome-Wide Association Study with Single-cell RNA-sequencing Uncovers Susceptibility Genes and Cell Types Underlying Lung Cancer Risk

Xu, S.; Shi, J.; Li, B.; Shu, X.-O.; Tao, R.; Cai, H.; Wen, W.; Deppen, S. A.; Zhou, M. X.; Xu, L.; Wang, J.; Wu, J.; Yang, Y.; Guo, X.; Zheng, W.; Long, J.; Cai, Q.

2026-03-23 epidemiology 10.64898/2026.03.19.26348840 medRxiv
Top 0.1%
19.2%
Show abstract

Genome-wide association studies (GWASs) have identified approximately 100 loci for lung cancer, but potential causal genes remain largely unknown. To address this, we conducted a lung tissue-specific transcriptome-wide association study (TWAS). Gene expression prediction models were constructed using data of adjacent normal lung tissues from our Vanderbilt Thoracic Biorepository (N=314) and normal lung tissues from the GTEx (N=466) and then applied to our lung cancer GWAS meta-analysis (55,174 cases and 1,294,174 controls). We identified 109 unique risk genes for lung cancer and its histological subtypes. Of them, 71 unique genes were novel discoveries, and 13 unique genes reside in novel loci. Smoking-conditional analysis revealed that 52 unique genes are unrelated to smoking behavior. Seven unique genes showed cell-type-specific colocalization within potential risk cell types, including the alveolar type I and II, dendritic, and natural killer cells. Seventeen unique genes are targeted of 58 drugs that have been approved or in Phase II or III trials. In addition, 22 unique potential causal genes were supported by both Mendelian randomization and colocalization. Functional validation identified three genes through in vitro knockdown experiments. Our study identified new lung cancer candidate risk genes and offered insights into lung cancer biology and future translational utilities.

2
Novel risk models based on screening history results and timing of lung cancer diagnosis: Post hoc analysis of the National Lung Cancer Screening Trial

Haddan, S.; Waqas, A.; Rasool, G.; Schabath, M. B.

2026-04-14 epidemiology 10.64898/2026.04.12.26350705 medRxiv
Top 0.1%
17.7%
Show abstract

Background: Our group previously reported that lung cancer (LC) screening history results and subsequent timing of diagnosis are associated with significant differences in survival outcomes. As a follow-up study, we sought to develop novel personalized risk models that considered screening history for incidence cancers, interval LCs, and prevalence LCs. Methods: Using data from the CT-arm of the NLST, four independent case-control analyses were conducted to develop parsimonious risk models. Controls (n=26,038) were those never diagnosed with LC. The four LC case groups were 270 prevalence LCs, 44 interval LCs, 206 screen-detected LCs (SDLCs) that had a baseline positive screen, and 164 SDLCs that had a baseline negative screen. For each case-control analysis, univariable analyses identified statistically significant covariates from 48 variables and then significant covariates were included into a stepwise backward selection approach to identify a model with the most informative covariates. Results: For prevalence LCs, the model (AUC=0.711) included age, pack-years smoked, BMI, smoking status, smoking onset age, personal history of cancer, family history of LC, alcohol consumption, and milling occupation. For interval LCs, the model (AUC=0.734) included age, smoking status, smoking onset age, cigar smoking, marital status, and asbestos occupation. For baseline positive SDLCs, the model (AUC=0.685) included age, pack-years smoked, BMI, emphysema, chemicals/plastics exposure, and milling occupation. For baseline negative SDLCs, the model (AUC=0.701) included age, pack-years smoked, BMI, smoking status, emphysema, sarcoidosis, and sandblasting occupation. Conclusions: Besides smoking and age, which are inclusion criteria for screening, these models identified other important risk factors which could be used to provide personalized LC risk assessment and screening management.

3
Five-Domain Accelerometer-Derived Behavioral Exposome and Incident Cancer Risk in UK Biobank

Ni Chan Chin (Chengqin Ni), M.; Berrio, J. A.

2026-04-12 epidemiology 10.64898/2026.04.07.26350369 medRxiv
Top 0.1%
12.7%
Show abstract

BackgroundAccelerometer-derived behavioral phenotype captures multidimensional aspects of human behavior extending well beyond physical activity, encompassing light exposure, step counts, physical activity patterns, sleep, and circadian rhythms. Whether these five domains constitute a unified behavioral architecture underlying cancer risk and whether circadian organization and light exposure confer incremental predictive value beyond movement volume alone remains to be comprehensively established. MethodsWe conducted an accelerometer-wide association study (AWAS) encompassing the complete accelerometer-derived behavioral exposome across five behavioral domains in UK Biobank participants with valid wrist accelerometry data. Incident solid cancers were designated as the primary endpoint, with prespecified site-specific solid cancers and hematological malignancy as secondary outcomes. Cox proportional hazards models with age as the timescale were used. The minimal covariate set served as the primary reporting tier, followed by sensitivity analyses additionally adjusting for adiposity/metabolic factors, independent activity patterns, shift work history, and accelerometry measurement quality. Nominal statistical significance was defined as two-sided P < 0.05 ResultsAmong 89,080 participants, 6,598 incident solid cancer events were observed over a median follow-up of 8.39 years. In the minimally adjusted model, the pan-solid-tumor association atlas was dominated by signals from activity volume, inactivity fragmentation, and circadian rhythm. Higher overall acceleration (HR per SD: 0.91, 95% CI: 0.89-0.94) and higher daily step counts (HR: 0.93, 95% CI: 0.90-0.95) were independently associated with reduced solid cancer risk, while inactivity fragmentation metrics were consistently linked to higher risk. Notably, circadian rhythms, most prominently cosinor mesor (Midline Estimating Statistic of Rhythm under cosinor model), emerged as leading inverse risk signals, underscoring the independent contribution of circadian behavioral architecture. Site-specific analyses revealed pronounced heterogeneity across tumor sites. Lung cancer exhibited a robust inverse activity-risk gradient, while breast cancer showed reproducible associations with MVPA. Most strikingly, nocturnal light exposure demonstrated a tumor-site-specific association confined to pancreatic cancer, a signal absent across all other sites examined. Associations for uterine cancer were predominantly inactivity-related and substantially attenuated following adjustment for adiposity and metabolic factors. ConclusionsAcross five accelerometer-derived behavioral domains, solid cancers as a whole were most consistently associated with a high-movement, low-fragmentation, and circadian-coherent behavioral profile. While site-specific heterogeneity exists, the broad cancer risk landscape is dominated by movement volume, inactivity fragmentation, and circadian rhythmicity. Light exposure, although more localized in its contribution, demonstrates a potentially novel and specific association with pancreatic cancer risk. These findings support a five-domain behavioral exposome framework for cancer epidemiology and, importantly, position circadian rhythm integrity and nocturnal light exposure as critically understudied dimensions warranting dedicated mechanistic investigation.

4
Tumor-Specific Divergence of Tumor-Associated Macrophage Prognostic Effects Across TCGA Lung and Melanoma Cohorts

Lehrer, S.; Rheinstein, P.

2026-02-24 oncology 10.64898/2026.02.23.26346900 medRxiv
Top 0.1%
10.1%
Show abstract

BackgroundTumor-associated macrophages (TAMs) display context-dependent functional polarization, but whether their prognostic impact is consistent across tumor types remains unclear. MethodsWe analyzed RNA-sequencing and clinical data from The Cancer Genome Atlas (TCGA) lung adenocarcinoma (LUAD; n=648), lung squamous carcinoma (LUSC; n=623), and melanoma (SKCM; n=466). Cox proportional hazards models adjusted for age and AJCC stage evaluated per-standard deviation (SD) expression of TAM markers (FOLR2, TREM2) and T-cell markers (CD8A, CXCL9). Cross-histology interaction terms tested divergence between LUAD and LUSC. ResultsIn melanoma, higher FOLR2 (HR 0.87), TREM2 (HR 0.83), CD8A (HR 0.69), and CXCL9 (HR 0.67) independently predicted improved survival. LUAD showed largely neutral macrophage effects. In contrast, LUSC demonstrated an adverse association for FOLR2 (HR 1.28). Interaction analysis confirmed significant divergence for FOLR2 and TREM2 between LUAD and LUSC. ConclusionsTAM-associated prognostic effects reverse by tumor histology, supporting tumor-context-dependent macrophage polarization and informing macrophage-targeted therapeutic strategies.

5
YBX3 overexpression in mesothelioma drives aberrant cell proliferation

Rubio, A.; Harvey, R. F.; Craxton, A.; Southwood, M.; Ficken, C.; Franco, C.; Kalmar, L.; Munoz, G.; Powley, I.; Kamrad, S.; Guan, R.; Fernandez-Antoran, D.; Patil, K. R.; Le Quesne, J. P.; MacFarlane, M.; Willis, A. E. E.

2026-02-12 molecular biology 10.64898/2026.02.11.705262 medRxiv
Top 0.1%
8.6%
Show abstract

Malignant pleural mesothelioma (MpM) is a lethal tumour closely linked to asbestos exposure and is a cancer of unmet clinical need with no known oncogenic drivers. Recent advancements in technologies to identify RNA-binding proteins (RBPs) has uncovered an emerging role for RBP-RNA interactions in cancer progression and we therefore assessed changes in the RBPome of patient-derived MpM cell lines. We identify over 350 RBPs showing altered RNA binding, with functions consistent with key cancer hallmarks, and discovered YBX3 as a potential oncoprotein driving cell proliferation in MpM. Mechanistically we show the impact of YBX3 on cell growth is achieved through its control of the expression of the amino acid transporter SLC7A5/LAT1, with increased amino acid uptake increasing protein synthesis rates. Notably, we show the inhibition of cell growth by YBX3 deletion is recapitulated by the clinically-relevant SLC7A5/LAT1 inhibitor JPH203. Finally, we demonstrate that JPH203 sensitizes MpM cells to radiotherapy, which could provide a promising therapeutic strategy for MpM. TeaserHigher levels of YBX3 expression in mesothelioma increases cell proliferation and protein synthesis rates by upregulation of amino acid uptake.

6
Pre-diagnostic lipid metabolites are enriched in men who develop advanced prostate cancer: a nested case-control study

Graff, R. E.; Fuller, H.; Wilson, K. M.; Dickerman, B. A.; Chan, J. M.; Kantoff, P. W.; Feng, X.; Clish, C. B.; Vander Heiden, M. G.; Darst, B. F.; Ebot, E. M.; Mucci, L. A.

2026-03-13 epidemiology 10.64898/2026.03.12.26348193 medRxiv
Top 0.1%
7.5%
Show abstract

Few studies with pre-diagnostic samples have estimated associations between circulating metabolites and risk of advanced prostate cancer. We performed untargeted metabolomic profiling of pre-diagnostic blood samples from 212 advanced prostate cancer cases (stage [&ge;]T3b or lethal during follow-up) and 212 matched controls from the Health Professionals Follow-up Study. 243 metabolites were assayed using liquid chromatography-tandem mass spectrometry (Broad Institute) and met quality control standards. We used multivariable conditional logistic regression to generate odds ratios (OR) and 95% confidence intervals (95%CI) for associations between individual metabolites and risk of advanced prostate cancer, and conducted metabolite set enrichment tests to identify metabolite classes enriched in advanced prostate cancer. Subgroup analyses were conducted by body mass index (BMI) and time between blood draw and diagnosis. Levels of 16 lipid species were nominally associated with advanced prostate cancer at p<0.05, though none were statistically significant after multiple testing correction. The strongest signals were for C56:1 triacylglycerol (TAG; OR: 1.34, 95%CI: 1.07-1.67) and C38:4 diacylglycerol (DAG; OR: 1.27, 95%CI: 1.04-1.55). Enrichment analyses revealed six metabolite classes associated with advanced prostate cancer after multiple testing adjustment, the top four of which were DAGs and TAGs: DAGs overall (P=3.4E-07), unsaturated DAGs (P=5.9E-07), unsaturated TAGs (P=2.3E-06), and TAGs overall (P=2.4E-06). 43 metabolites were nominally associated with advanced prostate cancer among individuals with BMI <25 kg/m2; only three demonstrated nominal associations in individuals with BMI [&ge;]25 kg/m2. These findings suggest associations between circulating pre-diagnostic lipid levels and aggressive prostate cancer risk, particularly in lean individuals.

7
Sex-stratified Integrated Analysis of US lung Cancer Mortality, 1994-2020

Islam, M. R.; Sayin, S. I.; Islam, H.; Shahriar, M. H.; Chowdhury, M. A. H.; Tasmin, S.; Konda, S.; Siddiqua, S. M.; Ahsan, H.

2026-03-06 oncology 10.64898/2026.03.01.26347234 medRxiv
Top 0.1%
7.2%
Show abstract

ImportanceLung cancer mortality in the United States has fallen substantially in recent decades, yet the relative influence of behavioral, environmental, socioeconomic, and therapeutic factors and their sex specific contributions remains unclear. Understanding these drivers is essential to sustain progress and reduce persistent disparities. ObjectiveTo quantify how behavioral, environmental, socioeconomic, and therapeutic determinants collectively shaped US lung cancer mortality from 1994 to 2020, assess sex specific differences, and forecast mortality trajectories through 2030 using an integrated machine learning framework. Design, Setting, and ParticipantsEcological time series study using publicly available national data from 1994 to 2020. Sex stratified analyses were conducted integrating lung cancer mortality, smoking prevalence, fine particulate matter PM2.5 exposure, Human Development Index HDI, per capita healthcare expenditure, healthcare inflation, insurance coverage, income inequality, and annual drug approvals. ExposuresBehavioral smoking, environmental PM2.5, socioeconomic HDI health expenditure inflation, uninsurance inequality, and therapeutic drug approval indicators. Main Outcomes and MeasuresAge-standardized lung cancer mortality per 100000 population. Temporal changes were modeled using Joinpoint regression. Concurrent associations were assessed using multivariable and elastic net regression, and forecasts were estimated with AutoRegressive Integrated Moving Average models with exogenous variables ARIMAX. ResultsFrom 1994 to 2020, mortality declined by 59 percent in men, from 52.9 to 21.7 per 100000, and by 40 percent in women, from 26.7 to 15.9 per 100000, with faster declines after 2015. Smoking and PM2.5 decreased by more than 45 percent but remained strongly correlated with mortality. In elastic net models, PM2.5 was the strongest predictor for men, while smoking was the strongest predictor for women. Per capita expenditure and HDI ranked higher for men, while uninsurance and income inequality were strong predictors for women. Mortality declines occurred during periods of major approvals of lung cancer drugs. Forecasts suggest continued but slower declines through 2030, with projected rates of 20.2 and 14.9 deaths per 100000 in men and women, respectively. Conclusions and RelevanceSex specific declines in lung cancer mortality reflect different dominant correlates, with air pollution more important in men and smoking more important in women, while socioeconomic conditions and therapeutic advances also influence trends. Continued tobacco control, improved air quality, and equitable access to screening and modern treatment are essential to sustain further reductions in mortality.

8
CT-Based Deep Foundation Model for Predicting Immune Checkpoint Inhibitor-Induced Pneumonitis Risk in Lung Cancer

Muneer, A.; Showkatian, E.; Kitsel, Y.; Saad, M. B.; Sujit, S. J.; Soto, F.; Shroff, G. S.; Faiz, S. A.; Ghanbar, M. I.; Ismail, S. M.; Vokes, N. I.; Cascone, T.; Le, X.; Zhang, J.; Byers, L. A.; Jaffray, D.; Chang, J. Y.; Liao, Z.; Naing, A.; Gibbons, D. L.; Vaporciyan, A. A.; Heymach, J. V.; Suresh, K. S.; Altan, M.; Sheshadri, A.; Wu, J.

2026-04-23 oncology 10.64898/2026.04.21.26351428 medRxiv
Top 0.1%
6.4%
Show abstract

Background: Immune checkpoint inhibitors (ICIs) have revolutionized cancer therapy but can cause serious immune-related adverse events (irAEs), with pneumonitis (ICI-P) being among the most severe. Early identification of high-risk patients before ICI initiation is critical for closer monitoring, timely intervention, and improved outcomes. Purpose: To develop and validate a deep learning foundation model to predict ICI-P from baseline CT scans in patients with lung cancer. Methods: We designed the Checkpoint-Inhibitor Pneumonitis Hazard EstimatoR (CIPHER), a deep learning foundation model that combines contrastive learning with a transformer-based masked autoencoder to predict ICI-P from baseline CT scans in patients with lung cancer. Using self-supervised learning, CIPHER was pre-trained on 590,284 CT slices from 2,500 non-small cell lung cancer (NSCLC) patients to capture heterogeneous lung parenchymal patterns. After pre-training, the model was fine-tuned on an internal NSCLC cohort for ICI-P risk prediction, using images from 254 patients for model development and 93 patients for internal validation. We compared CIPHER with classical radiomic models and further evaluated it on an external NSCLC cohort of 116 patients. Results: In the internal immunotherapy cohort, CIPHER consistently distinguished patients at elevated risk of ICI-P from those without the event, with AUCs ranging from 0.77 to 0.85. In head-to-head benchmarking, CIPHER achieved an AUC of 0.83, outperforming the radiomic models. In the external validation cohort, CIPHER maintained strong performance (AUC = 0.83; balanced accuracy = 81.7%), exceeding the radiomic models (DeLong p = 0.0318) and demonstrating higher specificity without sacrificing sensitivity. By contrast, the radiomic model showed high sensitivity (85.0%) but markedly lower specificity (45.8%). Confusion matrix analysis confirmed the robust classification performance of CIPHER, correctly identifying 80 of 96 non-ICI-P cases and 16 of 20 ICI-P cases. Conclusions: We developed and externally validated CIPHER for predicting future risk of ICI-P from pre-treatment CT scans. With prospective validation, CIPHER may be incorporated into routine patient management to improve outcomes.

9
Weight Trajectories and Cancer Risk: A Pooled Cohort Study

Nilsson, A.; da Silva, M.; Le, H. T.; Haggstrom, C.; Wahlstrom, J.; Michaelsson, K.; Trolle Lagerros, Y.; Sandin, S.; Magnusson, P. K.; Fritz, J.; Stocks, T.

2026-04-24 epidemiology 10.64898/2026.04.23.26351553 medRxiv
Top 0.1%
6.3%
Show abstract

Excess body weight has been associated with increased cancer risk, but the role of weight change across adulthood remains unclear. We examined body weight trajectories from ages 17 to 60 and their associations with site-specific cancer incidence. Data were based on the ODDS study, a pooled, nationwide cohort study in Sweden, with data on weight spanning 1911 to 2020, and cancer follow-up through 2023. Weight trajectories were estimated with linear mixed effects models in individuals with at least three weight measurements. Cox regressions estimated hazard ratios for associations between weight trajectories and established and potentially obesity-related cancers. Fifth versus first quintile of weight change was associated with many cancers, most strongly with esophageal adenocarcinoma in men (HR 2.25; 95% CI 1.66-3.04), liver cancer in men (HR 2.67; 95% CI 2.15-3.33), endometrial cancer in women (HR 3.78; 95% CI 3.09-4.61), and pituitary tumors in both sexes (men: HR 3.13 [95% CI 2.13-4.61]; women: HR 2.13 [95% CI 1.41-3.22]). Associations varied by sex and age. Heavier weight at age 17 years and earlier obesity onset were also associated with higher cancer incidence. These findings highlight the importance of a life-course approach to weight management and support sex- and age-targeted cancer prevention strategies.

10
Gene-Specific Cancer Patterns in Pathogenic Germline Variant Carriers

Idumah, G.; Ribaudo, I.; Newell, D.; Ni, Y.; Arbesman, J.

2026-01-30 oncology 10.64898/2026.01.27.26344970 medRxiv
Top 0.1%
4.9%
Show abstract

BackgroundWe previously reported that >5% of the population carries pathogenic or likely pathogenic variants (P/LPVs) in key cancer susceptibility genes. However, gene-specific cancer prevalence, spectrum, burden, lifetime risk, comorbidity, and the risk associated with autosomal recessive (AR) genes among carriers remain incompletely defined. MethodsWe analyzed 72 cancer susceptibility genes in the All of Us dataset (N=633,547), including 287,076 participants with both genomic and electronic health record data. Cancer diagnoses were identified using SNOMED codes and grouped into 35 categories. Associations between P/LPVs and overall and site-specific cancer risk were evaluated using regression models adjusted for age, sex, race, and ethnicity. ResultsAmong genes with [&ge;]10 unique carriers, cancer prevalence was highest for MEN1 (80%), followed by TP53 (57.7%), MLH1 (48.4%), and MSH2 (47.2%). Carriers of P/LPVs in BRCA1, BRCA2, MLH1, APC, NF1, PTEN, and PALB2 had significantly earlier cancer diagnosis compared to non-carriers. Cancer prevalence was markedly higher in BRCA1 and BRCA2 carriers who are also mono-allelic MUTYH carriers (75% and 45.5%, respectively) compared with BRCA1 and BRCA2 alone (43.2% and 36.5%). Adjusted survival analysis showed increased cancer risk for MLH1 (OR=6.08), PTEN (OR=5.80), and MSH2 (OR=5.19). Novel associations included MITF with anal/perianal and prostate cancer; BLM with ovarian and soft tissue/sarcoma; WRN with gynecologic cancer (NOS); and FH with hematologic malignancy. ConclusionsThis population-based analysis defines gene-specific cancer prevalence, spectrum, and risk, including contributions from AR variants, in the U.S. population. These findings support more precise genetic testing, screening, and risk stratification for individuals carrying inherited P/LPVs.

11
Time of Day as an Unmeasured Confounder in Oncology Trials

Somer, J.; Benor, G.; Alpert, A.; Perets, R.; Mannor, S.

2026-03-06 oncology 10.64898/2026.03.05.26347742 medRxiv
Top 0.1%
4.9%
Show abstract

A recent randomized clinical trial in non-small cell lung cancer1 confirms what numerous observational studies have reported - time-of-day (ToD) may dramatically influence treatment outcomes in cancer patients2-9. In this recent trial median overall survival (OS) decreased from 28 months in the early ToD arm to 16.8 months in the late ToD arm. We raise the concern that clinical trial outcomes may be influenced by seemingly minor biases in treatment time across arms. We also suggest that by measuring or randomizing treatment-time in clinical trials, we may identify beneficial ToD-dependent treatments that would otherwise be overlooked.

12
Postmastectomy Radiotherapy in pN1 Breast Cancer: Survival Outcomes and Prognostic Factors From a Single-Institution Cohort

Narasimhan, R. M.; Saini, A. S.; Samimi, K.; Ogobuiro, I.; Zhao, X.; Han, S.; Takita, C.; Taswell, C. S.

2026-02-02 oncology 10.64898/2026.01.27.26344082 medRxiv
Top 0.1%
4.7%
Show abstract

Structured AbstractO_ST_ABSPurpose/ObjectivesC_ST_ABSThe role of postmastectomy radiotherapy (PMRT) in patients with pathologic N1 (pN1) breast cancer, including triple-negative breast cancer (TNBC), remains controversial in the era of modern systemic therapy. We evaluated the association between PMRT and recurrence-free survival (RFS) and overall survival (OS) and identified prognostic factors in a contemporary single-institution pN1 cohort. Materials/MethodsWe retrospectively reviewed female patients with pT1-2N1M0 breast cancer treated with mastectomy between 2016 and 2022. RFS and OS were estimated using Kaplan-Meier methods and compared by PMRT status with log-rank testing. Univariable Cox proportional hazards models assessed associations between clinical factors--including tumor laterality, receptor subtype (TNBC vs non-TNBC), nodal burden, and adjuvant therapies--and survival outcomes, with subgroup analyses by PMRT status and receptor subtype. ResultsFifty-seven patients were included; 22 (38.6%) received PMRT. With a median follow-up of 85 months, PMRT was not associated with improved RFS (median 133 vs 120 months; p=0.256) or OS (not reached vs 195 months; p=0.154). Hormone therapy was significantly associated with improved RFS (HR 0.43; p=0.026) and OS (HR 0.13; p=0.003), while having 2-3 positive lymph nodes predicted worse RFS (HR 2.86; p=0.007). No significant differential benefit from PMRT was observed in patients with TNBC or non-TNBC disease. ConclusionsPMRT was not associated with a survival benefit in this pN1 cohort, including patients with TNBC. Interpretation is limited by modest sample size and statistical power. Outcomes appeared driven by tumor biology, nodal burden, and systemic therapy, supporting individualized PMRT decision-making.

13
Lung adenocarcinoma WHO histological classes contain distinct immune cell profiles

Nastase, A.; Olanipekun, M.; Starren, E.; Willis-Owen, S. A. G.; Mandal, A.; Domingo-Sabugo, C.; Morris-Rosendahl, D.; Lim, E.; Liang, L.; Nicholson, A. G.; Moffatt, M. F.; Cookson, W. O. C.

2026-03-26 genetic and genomic medicine 10.64898/2026.03.24.26348030 medRxiv
Top 0.1%
4.6%
Show abstract

Lung adenocarcinoma (LUAD) is classified internationally into six histological subtypes that predict clinical outcomes. Mutation analyses identify targets but provide less prognostic information than histological appearances. Immunotherapy in LUAD is constrained by the unpredictable immune environment within tumours. We therefore characterised relationships between WHO histological classification, common mutations, and underlying transcriptomic and immune profiles in 89 LUAD cases. Mutation profiles poorly correlated with histology or survival. Global gene expression was structured into 12 modules, identifying different tumour cells and pathways within WHO subtypes. Tumour classes also held distinctive immune cell profiles. Transcripts within high-risk solid tumours indicated enrichment of CD8+ and activated CD4+ T-cells, suggesting responsivity to immunotherapy. Independently from histologic classification, 31 transcripts were strongly associated with survival and were enriched in macrophage and fibroblast derived networks. The results suggest histological subtype stratification and typing for survival-associated markers have the potential to inform clinical trials of LUAD.

14
Systemic mutagen exposures reported by normal kidney cell genomes

Wang, Y.; Knight, W.; Ferreiro-Iglesias, A.; Abedi-Ardekani, B.; Pham, M. H.; Moody, S.; Hooks, Y.; Abascal, F.; Nunn, C.; Fitzgerald, S.; Cattiaux, T.; Gaborieau, V.; Fukagawa, A.; Jinga, V.; Rascu, S.; Sima, C.; Zaridze, D. G.; Mukeria, A. F.; Holcatova, I.; Hornakova, A.; Vasudev, N. S.; Banks, R. E.; Ognjanovic, S.; Savic, S.; Curado, M. P.; Zequi, S. d. C.; Reis, R. M.; Magnabosco, W. J.; Vianna, F.; Silva Neto, B.; Jarmalaite, S.; Zalimas, A.; Foretova, L.; Navratilova, M.; Phouthavongsy, L.; Shire, C.; Attawettayanon, W.; Sangkhathat, S.; Ding, C.; Lawson, A. R. J.; Latimer, C.; Humphre

2026-04-09 genomics 10.64898/2026.04.07.716715 medRxiv
Top 0.1%
4.4%
Show abstract

Lifestyle, environmental and other exposures to exogenous mutagens generate somatic mutations in normal human cells in vivo and increase cancer risk. However, the global repertoire of exogenous mutagen exposures is uncertain. The mutational signatures of mutagens in normal tissues offer opportunities to detect such exposures and survey them at population level. Using single-molecule duplex sequencing of normal kidney (n=319) and blood (n=272) samples from 10 countries, we show that normal kidney cell genomes report an extensive repertoire of somatic mutational signatures. Microdissection of kidney structures revealed that proximal tubules exhibit higher mutation rates than other components of the nephron and most normal cell types despite low cell division rates. This is explained by marked enrichment of mutational signatures due to known exogenous carcinogenic mutagens including the plant-derived aristolochic acids, as well as several signatures of unknown causes including an unknown agent prevalent in Japan (SBS12), and signatures of uncertain origins (SBS40b and SBS40c). The results suggest the existence of multiple, common, systemically circulating mutagens affecting human populations and indicate that the genomes of kidney proximal tubule cells report such exposures with high sensitivity.

15
Genomic ascertainment of PALB2-related cancer predisposition

Stewart, D.; Kim, J.; Haley, J. S.; Li, J.; Sargen, M. R.; Hong, H. G.; Tischkowitz, M.; McReynolds, L. J.; Carey, D. J.

2026-04-04 genetic and genomic medicine 10.64898/2026.04.03.26349984 medRxiv
Top 0.1%
4.4%
Show abstract

PURPOSE To evaluate cancer risk, age-specific penetrance, and mortality associated with heterozygous pathogenic or likely pathogenic (P/LP) germline PALB2 variants identified through genomic ascertainment and to assess modification by family history of cancer. PATIENTS AND METHODS We conducted a case-control study in two large population-based adult cohorts: the UK Biobank (n=469,580) and Geisinger MyCode (n=167,050). Individuals with heterozygous PALB2 P/LP variants were identified via exome sequencing and compared with non-carriers. Cancer diagnoses and vital status were obtained from linked registry and electronic health record data. We used multivariable logistic regression to estimate odds ratios (ORs) for cancer outcomes and Cox proportional hazards models to estimate hazard ratios (HRs) for all-cause mortality. Age-specific cumulative incidence (penetrance) was estimated using Kaplan-Meier methods. Models were adjusted for birth year, sex (when applicable), smoking status, and body mass index; stratified analyses assessed modification by family history of cancer. RESULTS PALB2 P/LP variant prevalence was 1:571 in UK Biobank and 1:940 in MyCode, with the higher prevalence in the UK cohort driven by the PALB2 p.Trp1038Ter founder variant. Compared with non-carriers, heterozygotes had significantly increased odds of any cancer, female breast cancer, pancreatic cancer, and cancers of ill-defined or secondary sites in both cohorts (P < 0.01). Adjusted hazard ratios for any cancer and female breast cancer ranged from 1.7 to 3.6. All-cause mortality was increased among PALB2-heterozygotes (HR 1.61-1.67), and survival after cancer diagnosis was reduced. Family history further modified cancer risk. CONCLUSION Genomic ascertainment of PALB2-heterozygotes identifies elevated risk for multiple cancers and increased mortality, although risks were lower than estimates from familial ascertainment. These findings inform risk management for individuals identified through genomic screening.

16
A multimodal AI biomarker PATH-ORACLE improves prediction of recurrence in stage I lung adenocarcinoma

Kilim, O.; Martinez Ruiz, C.; Pipek, O.; Sztupinszki, Z.; Huebner, A.; Diossy, M.; Prosz, A.; Moore, D.; Jamal-Hanjani, M.; Hackshaw, A.; Fillinger, J.; Moldvay, J.; Csabai, I.; Swanton, C.; Szallasi, Z.

2026-01-30 oncology 10.64898/2026.01.28.26344973 medRxiv
Top 0.1%
4.3%
Show abstract

The standard treatment for stage I lung adenocarcinoma is surgical resection, in most cases without additional systemic adjuvant treatment. A significant proportion of stage I cases recur with a less than 50% 5-year survival rate. There are clinical data suggesting that adjuvant treatment may improve survival in such recurrent cases. However, previously evaluated predictors such as the IASLC grading system from histological sections and transcriptomic profiles have not been sufficiently accurate and consistent for risk stratification and to guide therapeutic interventions. We hypothesized that these previously investigated diverse diagnostic measurements carry complementary information that may provide higher prognostic power when combined. Here we describe a multimodal deep learning method, PATH-ORACLE. This biomarker is built on top of the prospectively validated transcriptomic-based ORACLE score with the addition of routine histological sections processed by pre-trained foundation models. PATH-ORACLE predicts recurrence with an accuracy of over 85% in two independent cohorts. Given further validation this predictor could be used to prioritize stage IB patients for adjuvant chemotherapy in a more consistent fashion. Furthermore, for stage IA cases, PATH-ORACLE, combined with liquid biopsy-based monitoring may help identify high-risk patients suitable for adjuvant targeted therapy. HighlightsO_LIMultimodal AI model (PATH-ORACLE) integrates histology and transcriptomics to predict stage I LUAD recurrence C_LIO_LIPATH-ORACLE outperforms IASLC grading and transcriptomic or image-based models alone C_LIO_LIModel achieves >85% recurrence prediction accuracy across independent international cohorts C_LIO_LIPATH-ORACLE refines risk stratification within both stage IA and IB lung adenocarcinoma C_LIO_LIBiomarker may guide adjuvant therapy selection and surveillance in early-stage disease C_LI

17
Validation of Immunoscore for Prognostic Stratification in HPV-associated Oropharyngeal Cancer: An International Multicenter Study

Nguyen, D. H.; Majdi, A.; Marliot, F.; Houtart, V.; Kirilovsky, A.; Hijazi, A.; Fredriksen, T.; de Sousa Carvalho, N.; Bach, A.- S.; Gaultier, A.- L.; Fabiano, E.; Kreps, S.; Tartour, E.; Pere, H.; Veyer, D.; Blanchard, P.; Angell, H. K.; Pages, F.; Mirghani, H.; Galon, J.

2026-04-11 oncology 10.64898/2026.04.08.26350238 medRxiv
Top 0.1%
4.2%
Show abstract

BackgroundTreatment optimization in HPV-associated oropharyngeal cancer (OPSCC) remains challenging, as recent de-escalation trials have shown limited success. Current patient selection strategies based on smoking history and TNM classification are insufficient, highlighting the need for robust, standardized prognostic biomarkers. We report the first validation of the Immunoscore (IS) for prognostic stratification in HPV-associated OPSCC. Patients and methodsWe analyzed 191 HPV-associated (p16+ and HPV DNA/RNA+) OPSCC patients from an international multicenter cohort (2015-2024), comprising a French monocentric retrospective training cohort (N = 48) and three validation cohorts: French monocentric retrospective (N = 48), French multicenter prospective (N = 50), and US multicenter retrospective (N = 45). IS is a standardized digital pathology assay quantifying CD3lJ and CD8lJ densities in tumor cores and invasive margins, with cut-offs defined in the training cohort and validated across cohorts. Associations with disease-free survival (DFS), time to recurrence (TTR) and overall survival (OS) were assessed, alongside 3RNA-seq and sequential immunofluorescence profiling of immune composition. ResultsMedian age 65; 80% male; 74% smokers; 66% T1-2; 82% N0-1 (AJCC8th). IS-High patients demonstrated superior 3-year DFS in the training and validation cohorts 1-3 (all log-rank P < 0.05). Multivariable analysis identified IS-Low as the strongest independent risk factor for DFS (HR 9.03; 95% CI: 4.02-20.31; P < 0.001). The model combining IS with clinical factors showed higher predictive accuracy for DFS (C-index 0.82) than clinical variables alone (0.7; P < 0.0001). Similar findings were observed for TTR and OS. IS-High tumors showed markedly higher enrichment of lymphoid and myeloid immune cell populations, contrasting with immune-poor signatures in IS-Low tumors. ConclusionsIS is a robust biomarker that outperforms standard clinical variables in both prognostic and predictive accuracy. The enriched cytotoxic immune infiltrate in IS-High tumors explains favorable outcomes and supports their suitability for treatment de-escalation. Prospective validation is warranted.

18
Inherited genetic risk factors in young-onset lung cancer

Esai Selvan, M.; Gould Rothberg, B. E.; Patel, A. A.; Sang, J.; Horowitz, A.; Christiani, D. C.; Klein, R. J.; Gumus, Z. H.

2026-04-15 genetic and genomic medicine 10.64898/2026.04.14.26350822 medRxiv
Top 0.1%
4.2%
Show abstract

Introduction Lung cancer is rare before age 45, and its inherited genetic basis remains poorly defined. Methods We performed whole-genome sequencing in 171 predominantly young-onset lung cancer patients and integrated these data with whole-exome sequencing from six major lung cancer consortia, yielding 9,065 patients. After quality control, analyses focused on 6,545 individuals of European ancestry, the largest ancestral group. We compared the prevalence of rare pathogenic and likely pathogenic (P/LP) germline variants between 186 young-onset (age <45 years) and 6,359 older patients at gene and gene-set levels using Fisher's exact test, stratified by histology, sex, and smoking status. Polygenic risk scores (PRS) derived from common variants were also evaluated. Results Young-onset patients carried a higher burden of rare germline P/LP variants in DNA damage response (DDR) genes (including BRIP1, ERCC6, MSH5), and in cilia-related genes, notably GPR161. At the pathway level, DDR genes were significantly enriched (OR=1.66, p=0.007), with the strongest signal in the Fanconi Anemia pathway and among females (OR=1.96, p=0.01). Enrichment was also observed in inborn errors of immunity pathways, with strongest signals in antibody deficiency and the complement system genes. Young-onset patients additionally exhibited higher lung cancer PRS. Conclusion Young-onset lung cancer exhibits a distinct germline genetic architecture, characterized by enrichment of rare P/LP variants in DDR, cilia-related, and immune pathways, and an elevated lung cancer PRS. These findings support a greater role for inherited susceptibility in early-onset disease and have implications for risk stratification, earlier screening, and precision prevention.

19
Clinical and pathological characteristics of thin cutaneous melanomas with rapid recurrence.

Bhave, P.; Wong, T.; Margolin, K.; Hoeijmakers, L.; Mangana, J.; Vitale, M. G.; Ascierto, P. A.; Maurichi, A.; Santinami, M.; Heddle, G.; Allayous, C.; Lebbe, C.; Kattak, A.; Forchhammer, S.; Kessels, J. I.; Lau, P.; Lo, S. N.; Papenfuss, A. A.; McArthur, G. A.

2026-04-06 oncology 10.64898/2026.04.04.26350182 medRxiv
Top 0.1%
4.0%
Show abstract

Background: Although thin, T1 melanomas have an excellent cure rate with surgery alone, >25% of melanoma deaths originate from thin melanomas (TMs). There is, therefore, an urgent need to improve the identification and management of patients with TMs at high risk of recurrence. Methods: Patients with T1 melanoma and recurrence [&le;] 2 years of diagnosis (T1 rapid group) were compared to patients with T1 melanoma and recurrence [&ge;]10 years after diagnosis (T1 late group). Results: 442 patients from 14 sites were included: 310 and 132 patients in the T1 rapid and late groups, respectively. Median age at primary melanoma diagnosis was 51 years [15-85], 272 (62%) male, 254 (58%) superficial spreading and 101 (23%) head/neck primary. The majority (73%) of recurrences in the T1 rapid group were locoregional. Using univariable logistic regression analysis, age >65 years (p<0.0001), lentigo maligna (LM) melanoma subtype (p=0.025), head/neck primary site (p=0.0065), mitoses [&ge;]1/mm2 (p=0.0181) and ulceration (p=0.0087) were significantly associated with T1 rapid recurrence compared to T1 late recurrence. Using multivariable analysis, age >65 years (p=0.0010), mitoses [&ge;]1/mm2 (p=0.049) and ulceration (p=0.037) remained significant. Conclusions: Rapid recurrence of TM is associated with age >65 years, LM subtype, head/neck primary site, mitoses [&ge;]1/mm2 and ulceration.

20
Enhanced expression of HLA-DR and CD69 on peripheral CD4+ T cells predicts better clinical outcomes in cutaneous melanoma

Tomas, A.; Maximino, J.; Nunes, H.; Salvador, R.; Luis, R.; Brito, C.; Saraiva, D. P.; Gouveia, E.; Pereira, C.; Goncalves, F.; Farricha, V.; Carvalho, E. L.; Moura, C.; Passos, M. J.; Cristovao-Ferreira, S.; Pereira, P. M.; Cabral, M. d. G.; Pojo, M.

2026-03-26 oncology 10.64898/2026.03.24.26349163 medRxiv
Top 0.1%
4.0%
Show abstract

BackgroundCutaneous melanoma (CM) is an aggressive skin cancer with rising incidence, representing a growing public health concern. Despite the remarkable success of immune-checkpoint inhibitors (ICIs) in the management of advanced disease, mortality remains high due to therapy resistance. Identifying reliable prognostic and predictive biomarkers is therefore essential to improve patient stratification, optimize treatment selection, and minimize unnecessary toxicity. MethodsWe comprehensively profiled the circulating immune landscape of 54 treatment-naive CM patients by integrating flow cytometry immunophenotyping with clinicopathological data, and performed tumor gene expression analysis in a subset of 26 patients. ResultsElevated HLA-DR and CD69 expression on circulating CD4+ T cells, together with reduced circulating CD8+ T cell frequency, emerged as candidate prognostic biomarkers associated with improved survival. Prognostic models combining these immune variables with clinical covariates accurately stratified patients by overall survival (89.5% sensitivity, 72.7% specificity; AUC = 0.872, p < 0.0001) and progression/recurrence risk (75% sensitivity and 71.4% specificity; AUC = 0.763, p = 0.001). In a subset of 43 patients subsequently treated with ICIs, elevated baseline HLA-DR and CD69 expression on circulating CD4+ T cells was also associated with therapeutic benefit. A predictive model integrating these markers with clinical covariates achieved good discriminatory performance (65.2% sensitivity, 88.9% specificity; AUC = 0.775, p = 0.0027). Tumor gene expression profiling supported the role of IFN-{gamma}-related signatures, previously linked to ICI response, as complementary prognostic and predictive tools. ConclusionThese findings highlight systemic CD4+ T cell activation status as a promising, easily measurable biomarker in CM, laying the foundation for future strategies to refine patient stratification and guiding immunotherapy decisions.